23 中层视觉处理和视觉元素
我们在第 21 章和第 22 章中已经看到,眼睛不仅仅是一个相机,而是包含复杂的视网膜回路,该回路将视
网膜图像分解为表示对比度和运动的信号。这些数据通过视神经传送到初级视觉皮层,初级视觉皮层使用这些
信息来分析物体的形状。它首先识别目标的边界,由许多短线段表示,每个线段都有特定的方向。皮层然后将此
信息整合到特定目标的表示中,这一过程称为轮廓整合。
2 个步骤,方向的局部分析和轮廓整合,举例说明了视觉处理的 2 个不同阶段。局部方向计算是低层视
觉处理的一个例子,它与识别视野光结构的局部元素有关。轮廓整合是中层视觉处理的一个例子,是生成统一
视野表示的第一步。在大脑皮层分析的最早阶段,这 2 个层次的处理是一起完成的。
一个视觉场景包含成千上万的线段和表面。如图 21.1.4 所示,中层视觉处理涉及确定哪些边界和表面属
特定目标,哪些是背景的一部分。它还涉及从表面反射的光的强度和波长区分表面的亮度和颜色。反射光的物
理特性既取决于照亮表面的光的强度和颜色平衡,也取决于该表面的颜色。确定单个物体的实际表面颜色需要
比较场景中多个表面反射的光的波长。
因此,中层视觉处理涉及将图像的局部元素组合成对物体和背景的统一感知。虽然确定哪些元素属于一个
单一目标是一个非常复杂的问题,具有天文数字的潜在解决方案,但大脑视觉回路中的每个中继都有内置逻辑,
允许对元素之间可能的空间关系做出假设。如图 23.0.1 所示,在某些情况下,这些固有规则会导致视野中实
上不存在的轮廓和表面的错觉。
视觉处理 3 特征有助于克服来自视网膜的信号的歧义。首先,感知视觉特征的方式取决于它周围的
切。例如,对点、线或表面的感知取决于该特征与场景中存在的其他事物之间的关系。也就是说,视觉皮层中神
经元的反应是依赖于上下文的:它取决于细胞感受野外的轮廓和表面的存在,以及细胞感受野内的属性。其次,
视觉皮层神经元的功能特性可以通过视觉经验或知觉学习来改变。最后,皮层中的视觉处理受认知功能的影响,
特别是注意力、期望和“感知任务”(积极参与视觉辨别或检测)。这 3 个因素(表示场景的上下文或整套信号、
皮层回路中经验依赖性变化以及期望)之间的相互作用对于视觉系统对复杂场景的分析至关重要。
在本章中,我们将研究大脑对视觉场景中局部特征或视觉原始要素的分析如何与对更多全局特征的分析并
行进行。视觉原始要素包括对比度、线方向、亮度、颜色、运动和深度。每种类型的视觉原始要素都受到中间
级处理的综合作用。具有特定方向的线被整合到物体轮廓中,局部对比度信息被整合到表面亮度和表面分割中,
波长选择性被整合到颜色恒常性中,方向选择性被整合到物体运动中。
视觉原始要素的分析从视网膜开始,检测亮度和颜色,然后在初级视觉皮层继续分析方位、运动方向和立
体深度。与中层视觉处理相关的属性与从初级视觉皮层开始的视觉皮层中的视觉原始要素一起分析,它在轮廓
整合和表面分割中发挥作用。视觉皮层的其他区域专注于这项任务的不同方面:如图 23.0.2 所示,二级视觉
分析与物体表面相关的属性,四级视觉皮层整合关于颜色和物体形状的信息,而级视觉皮(颞中区或
侧颞叶)整合空间中的运动信号。
23.1 物体几何内部模型帮助大脑分析形状
确定目标轮廓的第一步是识别轮廓局部部分的方向。此步骤从初级视觉皮层开始,该步骤在局部和全局形
式分析中都起着至关重要的作用。
视觉皮层中的神经元对视野的特定局部特征有选择性响应,包括方向,双眼差或深度,运动方向以及在视网
膜和外侧膝状核中已经分析的特性,例如对比度和颜色。方向选择性,这是 1959 大卫 · 休伯托斯坦 ·
泽尔在皮层神经元的感受野中发现的第一个新兴特性。
视网膜(第 22 章)和外侧膝状核(第 21 章)中的神经元都有具有中心周围组织的圆形感受野。如图 21.4.2
示,它们响应视野中边缘或线条的光线对比度,但对这些边缘的方向没有选择性。然而,在视觉皮层中,神经元
对特定方向的线有选择性反应。每个神经元对狭窄的方向响应,大约 40°并且不同的神经元对不同方向的反应
最佳。休伯泽尔出,这种方向选择性反映了从外侧膝状核中输入的排列,现在有大量支持该思想的支
23.1 物体几何内部模型帮助大脑分析形状
546 Part IV / Perception
Figure 23–1 Illusory contours and perceptual fill-in. The vis-
ual system uses information about local orientation and contrast
to construct the contours and surfaces of objects. This construc-
tive process can lead to the perception of contours and surfaces
that do not appear in the visual field, including those seen in
illusory figures. Top left: In the Kanizsa triangle illusion, one per-
ceives continuous boundaries extending between the apices of
a white triangle, even though the only real contour elements are
those formed by the Pac-Man–like figures and the acute angles.
Top right: The inside and outside of the illusory pink square are
the same white color as the page, but a continuous transparent
pink surface within the square is perceived. Bottom: Occlud-
ing surfaces can also facilitate contour integration and surface
segmentation. The irregular shapes on the left appear to be
unrelated, but when they are partially occluded by black shapes
(right), they are easily seen as fragments of the letter B.
allows assumptions to be made about the likely spatial
relationships between elements. In certain cases, these
inherent rules can lead to the illusion of contours and
surfaces that do not actually exist in the visual field
(Figure 23–1).
Three features of visual processing help overcome
ambiguity in the signals from the retina. First, the way
in which a visual feature is perceived depends on eve-
rything that surrounds it. The perception of a point,
line, or surface, for example, depends on the relation-
ship between that feature and what else is present
in the scene. That is, the response of a neuron in the
visual cortex is context-dependent: It depends as much
on the presence of contours and surfaces outside the
cell’s receptive field as on the attributes within it.
Second, the functional properties of neurons in the
visual cortex can be altered by visual experience or
perceptual learning. Finally, visual processing in the
cortex is subject to the influence of cognitive functions,
specifically attention, expectation, and “perceptual
task (the active engagement in visual discrimination or
detection). The interaction between these three factors—
the context or entire set of signals representing a scene,
experience-dependent changes in cortical circuitry,
Kandel-Ch23_0545-0563.indd 546 09/12/20 4:50 PM
23.0.1
: 虚幻的轮廓和感性的填充。视觉系统使用关于局部方向和对比度的信息来构建物体的轮廓和表面。
种构建性的过程可以导致对没有出现在视野中的轮廓和表面的感知,包括在虚幻的图形中看到的轮廓和曲面。
上角:在卡尼萨三角形错觉中,人们可以感知到白色三角形顶点之间延伸的连续边界,尽管唯一真正的轮廓元
素是由吃豆人形状和锐角形成的。右上角:虚幻的粉红色正方形的内部和外部与页面的白色相同,但可以看到
正方形内连续的透明粉红色表面。底部:遮挡曲面也可以促进轮廓集成和曲面分割。左边的不规则形状看起来
是不相关的,但当它们被黑色形状(右边)部分遮挡时,它们很容易被视为字母 B 的碎片。
479
23.1 物体几何内部模型帮助大脑分析形状
Chapter 23 / Intermediate-Level Visual Processing and Visual Primitives 547
Figure 23–2 Cortical areas involved with intermediate-level
visual processing.Many cortical areas in the macaque mon-
key, including V1, V2, V3, V4, and middle temporal area (MT),
are involved with integrating local cues to construct contours
and surfaces and segregating foreground from background.
The shaded areas extend into the frontal and temporal lobes
because cognitive output from these areas, including attention,
expectation, and behavioral task, contributes to the process of
scene segmentation. (Abbreviations: AIP, anterior intraparietal
cortex; FEF, frontal eye fields; IT, inferior temporal cortex; LGN,
lateral geniculate nucleus; LIP, lateral intraparietal cortex; MD,
medial dorsal nucleus of thalamus; MIP, medial intraparietal
cortex; MST, medial superior temporal cortex; MT, middle tem-
poral cortex; PF, prefrontal cortex; PL, pulvinar; PMd, dorsal
premotor cortex; PMv, ventral premotor cortex; SC, superior
colliculus; TEO, occipitotemporal cortex; VIP, ventral intrapa-
rietal cortex; V1, V2, V3, V4, primary, secondary, third, and
fourth visual areas.)
and expectation—is vital to the visual system’s analy-
sis of complex scenes.
In this chapter, we examine how the brain’s
analysis of the local features in a visual scene, or
visual primitives, proceeds in parallel with the analy-
sis of more global features. Visual primitives include
contrast, line orientation, brightness, color, movement,
and depth. Each type of visual primitive is subject to
the integrative action of intermediate-level processing.
Lines with particular orientations are integrated into
object contours, local contrast information into sur-
face brightness and surface segmentation, wavelength
selectivity into color constancy, and directional selec-
tivity into object motion.
The analysis of visual primitives begins in the
retina with the detection of brightness and color and
continues in the primary visual cortex with the analy-
sis of orientation, direction of movement, and stereo-
scopic depth. Properties related to intermediate-level
visual processing are analyzed together with visual
primitives in the visual cortex starting in the primary
visual cortex (V1), which plays a role in contour inte-
gration and surface segmentation. Other areas of the
visual cortex specialize in different aspects of this task:
V2 analyzes properties related to object surfaces, V4
integrates information about color and object shape,
and V5—the middle temporal area or MT—integrates
motion signals across space (Figure 23–2).
前馈
反馈
PMd
PMv
PF
FEF
AIP
MT/
MST
MD
PL
LGN
TEO
IT
SC
VIP
初级视
觉皮层
V3
V2
V4
背侧通路
腹侧通路
MIP
LIP
Internal Models of Object Geometry Help the
Brain Analyze Shapes
A first step in determining an object’s contour is identi-
fication of the orientation of local parts of the contour.
This step commences in V1, which plays a critical role
in both local and global analysis of form.
Neurons in the visual cortex respond selectively
to specific local features of the visual field, including
orientation, binocular disparity or depth, and direc-
tion of movement, as well as to properties already ana-
lyzed in the retina and lateral geniculate nucleus, such
as contrast and color. Orientation selectivity, the first
emergent property identified in the receptive fields of
cortical neurons, was discovered by David Hubel and
Torsten Wiesel in 1959.
Neurons in both the retina (Chapter 22) and the
lateral geniculate nucleus (Chapter 21) have circular
receptive fields with a center-surround organization.
They respond to the light–dark contrasts of edges or
lines in the visual field but are not selective for the ori-
entations of those edges (see Figure 21–9). In the visual
cortex, however, neurons respond selectively to lines
of particular orientations. Each neuron responds to a
narrow range of orientations, approximately 40°, and
different neurons respond optimally to distinct orien-
tations. Hubel and Wiesel proposed that this orienta-
tion selectivity reflects the arrangement of the inputs
Kandel-Ch23_0545-0563.indd 547 09/12/20 4:50 PM
23.0.2: 涉及中层水平视觉处理的皮层区域。猕猴的许多皮层区域,包括初级视觉皮层二级视觉皮层三级
视觉皮层四级视觉皮层内侧颞叶都参与整合局部线索以构建轮廓和表面,并将前景与背景分离。阴影区域
延伸到额叶和颞叶,因为这些区域的认知输出,包括注意力、期望和行为任务,有助于场景分割过程。
持证据。如 23.1.1 示,每个初级视觉皮层神经元都从几个相邻的膝状核神经元中接收输入,这些神经元
中心周围感受野是对齐的,以表示特定的方向轴。已经确定了 2 种主要的方向选择性神经元的类型。
如图 23.1.2 所示,简单细胞的感受野分为 ON OFF 子域。当视觉刺激(例如光线)进入感受野的 ON
域时,神经元会发放动作电位。当光条离开 OFF 子域时,该细胞也会做出响应。简单细胞对移动光条具有特征
性响应。当一条光条从 OFF 子域移动到 ON 子域时,它们会迅速活跃。因此,这些细胞的反应对于空间中线条
或边缘的位置具有很高的选择性。
复杂细胞对目标边界的位置的选择性较小。如图 23.1.2 所示,他们缺乏明确的 ON OFF 子域,并且在其
感受野的所有位置都对光和暗做出了类似的反应。他们随着线或边缘刺激的遍历而持续活跃。伯尔威泽尔
提出,复杂的细胞是简单感受野之后阐述的第二阶段,并且是通过重叠的简单感受野构建的。
当人们考虑到早期视皮层区域中描述的感受野特性的范围时,重要的是要指出系统发育的差异,不同物种
在这些特性首次表达的位置和表现的特性种类方面存在差异。在猫的视觉皮层中,外侧膝状核神经元的目标层
具有定向的简单细胞。据推测,这些皮层细胞代表了视觉信息的皮层处理中的一个必要的第一阶段,位于外侧
膝状核中心-周围圆对称感受野和皮层表浅层复杂细胞的感受野之间。然而,在灵长类动物中,膝状核的目标层
4Cα β 有圆形的对称,无定向的感受野。4C 细胞的后突触靶主要是皮层的表层,并用复杂的细胞填充,
因此跳过了一个简单的细胞阶段。在小鼠中,在外侧膝状核中可以看到方向选择性。前面的比较指出了视觉处
理演变的一些特征。一个是功能的脑化,其中诸如方向之类的属性转移到了进化阶段的处理阶段。另一个是新
途径的发展。有人提出,灵长类动物的大细胞通路相当于猫的整个膝状纹状体通路,而小细胞通路,介导更高分
辨率的视觉和彩色视觉,对灵长类动物是新的。
运动刺激通常用于研究视觉皮层神经元的感受野,不仅用于模拟检测到空间中的物体在空间中移动的条件,
还可以模拟眼睛运动产生的条件。当我们扫描视觉环境时,静止物体的边界会在视网膜上移动。实际上,视觉感
知需要眼动。视觉皮层神经元对在视网膜上稳定的图像没有反应。这些神经元需要瞬时刺激(移动或闪烁刺激)
才能激活。
一些视觉皮层神经元具有感受野,其中兴奋中心的两侧是抑制区域。如图 23.1.3 所示,沿着方向轴的抑
区域(即所谓的末端抑制性)限制了神经元对一定长度线的响应。末端抑制性神经元对不延伸到抑制性区域但
完全位于兴奋性部分的感受野内的线条响应很好。由于抑制区具有中央兴奋性区域的取向偏好,因此末端抑制
性细胞对线曲率有选择性,并且对角的反应也很好。
480
23.2 深度感知有助于将物体与背景分离
548 Part IV / Perception
Figure 23–3 Orientation selectivity and mechanisms.
A.A neuron in the primary visual cortex responds selectively to
line segments that fit the orientation of its receptive field. This
selectivity is the first step in the brains analysis of an object’s
form. (Reproduced, with permission, from Hubel and Wiesel
1968. Copyright © 1968 The Physiological Society.)
B.The orientation of the receptive field is thought to result from
the alignment of the circular center-surround receptive fields of
several presynaptic cells in the lateral geniculate nucleus. In the
monkey, individual neurons in layer IVCβ of V1 have unoriented
receptive fields. However, when several neighboring IVCβ cells
project to a neuron in layer IIIB they create a receptive field with
a specific orientation for that postsynaptic cell.
from the lateral geniculate nucleus, and there is now
a body of supportive evidence for the idea. Each
V1 neuron receives input from several neighboring
geniculate neurons whose center-surround receptive
fields are aligned so as to represent a particular axis
of orientation (Figure 23–3). Two principal types of
orientation-selective neurons, simple and complex,
have been identified.
Simple cells have receptive fields divided into ON and
OFF subregions (Figure 23–4). When a visual stimulus
such as a bar of light enters the receptive field’s ON sub-
region, the neuron fires; the cell also responds when the
bar leaves the OFF subregion. Simple cells have a charac-
teristic response to a moving bar; they discharge briskly
when a bar of light leaves an OFF region and enters an
ON region. The responses of these cells are therefore
highly selective for the position of a line or edge in space.
Complex cells are less selective for the position of
object boundaries. They lack discrete ON and OFF
subregions (Figure 23–4) and respond similarly to light
and dark at all locations across their receptive fields.
They fire continuously as a line or edge stimulus trav-
erses their receptive fields. Hubel and Wiesel proposed
that the complex cells are a second stage of the elabora-
tion of receptive fields after simple receptive fields and
are built by overlapping simple receptive fields.
III
B
IVCβ
神经元 感受野
皮层
AB
As one considers the range of receptive field prop-
erties that have been described in the early visual cor-
tical areas, it is important to point out phylogenetic
differences, with different species differing in the loca-
tion in which these properties are first expressed and in
the kinds of properties that are represented. In the cat,
the target layer of the visual cortex for lateral genicu-
late neurons has oriented simple cells; it had been pre-
sumed that these cortical cells represent an obligatory
first stage in the cortical processing of visual informa-
tion, between the center-surround circularly symmet-
ric receptive fields in the lateral geniculate nucleus and
the receptive fields of complex cells in the superficial
cortical layers. In primates, however, the geniculate
target layers, 4Cα and β, have circularly symmetric,
unoriented receptive fields. The postsynaptic target of
the layer 4C cells, predominantly the superficial layers
of the cortex, is populated with complex cells, there-
fore skipping a simple cell stage. In the mouse, orientation
selectivity is seen in the lateral geniculate nucleus. The
preceding comparison points out a few characteris-
tics of the evolution of visual processing. One is the
encephalization of function, where properties such
as orientation are shifted to later stages of process-
ing over stages of evolution. Another is the develop-
ment of new pathways. It has been suggested that the
Kandel-Ch23_0545-0563.indd 548 09/12/20 4:50 PM
23.1.1: 方向选择性和机制。A. 初级视觉皮层中的神经元选择性地响应适合其感受野方向的线段。这种选择性
是大脑分析物体形状的第一
[192]
B. 感受野的方向被认为是由几个突触前细胞的圆形中心-绕感受野对齐的
结果。膝状核。在猴子中,初级视觉皮层 IVCβ 中的单个神经元具有无方向的感受野。然而,当几个相邻
IVCβ 细胞投射到 IIIB 层中的神经元时,它们会为该突触后细胞创建一个具有特定方向的感受野。
为了定义整个目标的形状,视觉系统必须将有关局部方向和曲率的信息集成到目标轮廓中。如图 23.1.4
示,视觉系统整合轮廓的方式反映了自然世界中存在的几何关系。正如心理学家格式塔最初在 20 世纪初指出的
那样,立即识别的轮廓往往遵循良好的延续规则(弯曲线保持稳定的曲率半径和直线的持续性)在复杂的视觉
场景中,这种平滑的轮廓往往会“凸显出来”,而更多的锯齿状轮廓很难检测到。
视觉皮层神经元的反应可以通过刺激来调节,刺激本身不会激活细胞,因此位于感受野的核心之外。这种
上下文调节赋予神经元具有更复杂刺激的选择性,而不是通过将刺激的成分放置在感受野和周围的不同位置来
预测的。如图 23.1.4a 所示,相同的视觉特征,可以促进复杂场景中目标检测也适用于上下文调节。赋予轮廓感
感知的特征的特性,甚至是虚幻的特征,都反映在初级视觉皮层中神经元的响应中,这些神经元对轮廓的全局
特征敏感,甚至那些延伸在其感受野之外的轮廓。
如图 23.1.4b 所示,视觉空间大区域的上下文影响可能是由具有相似方向选择性的视觉皮层中多个神经元之
间的连接介导的。如图 21.6.2 所示,这些连接是由平行于皮层表面的锥体轴突形成的。如图 21.5.3 所示,这些水
平连接的程度和方向依赖性提供了可以介导轮廓显著性的相互作用。
轮廓整合过程的核心是关联区域的思想。关联区域是指感知将轮廓元素链接到全局轮廓所需的视觉空间之
间的交互。它奠定了良好延续的格式塔原理,以及嵌入复杂场景中的平滑轮廓的感知显著性。从生理上讲,它
在一定程度上解释了轮廓元素延伸到其“经典”感受野之外时的神经元响应促进现象。从解剖学上讲,它部分
是由远程水平连接与皮层功能体系结构之间的关系所介导的。尽管在初级视觉皮层中对其进行了最广泛的研究,
但由于皮层所有区域的水平连接无处不在,因此它很可能是关联每个皮层区域中映射的信息位的策略。关联区
域在初级视觉皮层以外的皮层区域的功能作用取决于信息在皮层表面的映射以及这些映射与水平连接复杂网络
之间的关系。
481
23.2 深度感知有助于将物体与背景分离
简单细胞
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
复杂细胞
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
23.1.2: 视觉皮层中的简单和复杂细胞。简单细胞的感受野被分成具有相反响应特性的子域。在给光子域(用
+ 表示)中,光的开始触发神经元的反应;在撤光子场(由 - 表示)中,光条的熄灭触发响应。复杂细胞具有重
叠的 ON OFF 区域,并且当线或边缘沿着垂直于感受野方向的轴穿过感受野时连续响应。
482
23.2 深度感知有助于将物体与背景分离
550 Part IV / Perception
Figure 23–5 End-inhibited receptive
fields.Some receptive fields have a
central excitatory region flanked by
inhibitory regions that have the same
orientation selectivity. Thus, a short
line segment or a long curved line will
activate the neuron (A and C), but a
long straight line will not (B). A neuron
with a receptive field that displays only
one inhibitory region in addition to the
excitatory region can signal the pres-
ence of corners (D).
细胞响应
刺激
感受野
AB
CD
to the interactions across visual space required to per-
ceptually link contour elements into global contours.
It underlies the Gestalt principle of good continuation
and the perceptual saliency of smooth contours embed-
ded in complex scenes. Physiologically, it underlies the
facilitation of neuronal responses by contour elements
extending outside their “classical” receptive fields.
Anatomically, it is mediated in part by the relation-
ship between long-range horizontal connections and
cortical functional architecture. Though it has been
investigated most extensively in primary visual cor-
tex, because of the ubiquity of horizontal connections
across all areas of cortex, it is likely to be a strategy for
associating bits of information that are mapped within
every cortical area. The functional role of the associa-
tion field in cortical areas outside of V1 depends on
how information is mapped across the cortical sur-
face and the relationship between these maps and the
plexus of horizontal connections.
Depth Perception Helps Segregate Objects
From Background
Depth is another key feature in determining the per-
ceived shape of an object. An important cue for the
perception of depth is the difference between the two
eyes’ views of the world, which must be computed and
reconciled by the brain. The integration of binocular
input begins in the primary visual cortex, the first level
at which individual neurons receive signals from both
eyes. The balance of input from the two eyes, a property
known as ocular dominance, varies among cells in V1.
Binocular neurons in many visual cortical areas
are also selective for depth, which is computed from
the relative retinal positions of objects placed at differ-
ent distances from the observer. An object that lies in
the plane of fixation produces images at corresponding
positions on the two retinas (Figure 23–7). The images
of objects that lie in front of or behind the plane of fixa-
tion fall on slightly different locations in the two eyes,
a property known as binocular disparity. Individual
neurons can be selective for a narrow range of dispari-
ties and therefore positions in depth. Some are selec-
tive for objects lying on the plane of fixation (tuned
excitatory or inhibitory cells), whereas others respond
only when objects lie in front of the plane of fixation
(near cells) or behind that plane (far cells).
Depth plays an important role in the perception
of object shape, in surface segmentation, and in estab-
lishing the three-dimensional properties of a scene.
Objects that are placed near an observer can partially
occlude those situated farther away. A surface pass-
ing behind an object is perceived as continuous even
though its two-dimensional image on each retina rep-
resents two surfaces separated by the occluder. When
the brain encounters a surface interrupted by gaps that
have appropriate alignment and contrast, and lying in
the near-depth plane, it fills in the gaps to create a con-
tinuous surface (Figure 23–8).
Although the depth of a single object can be
established easily, determining the depths of multiple
objects within a scene is a much more complex problem
that requires linking the retinal images of all objects
in the two eyes. The disparity calculation is therefore
a global one: The calculation in one part of the visual
image influences the calculation for other parts. When
the assignment of depth is unambiguous in one part
of an image, that information is applied to other parts
of the image where there is insufficient information to
Kandel-Ch23_0545-0563.indd 550 09/12/20 4:50 PM
23.1.3: 末端抑制感受野。一些感受野有一个中央兴奋区,两侧是具有相同方向选择性的抑制区。因此,短线
段或长曲线将激活神经元(A C,但长直线不会(B。具有感受野的神经元除兴奋区外仅显示一个抑制区,
可以发出角点(D)存在的信号。
23.2 深度感知有助于将物体与背景分离
深度是确定目标感知形状的另一个关键特征。深度感知的一个重要线索是两只眼睛对世界的视图之间的差
异,必须由大脑计算和调和。双眼输入的整合始于初级视觉皮层,这是单个神经元从两只眼睛接收信号的第一
级。在初级视觉皮层中的细胞中,两只眼睛的输入的平衡(一种称为眼优势)的平衡各不相同。
许多视觉皮层区域中的双眼神经元也对深度具有选择性,这是根据位于与观察者不同距离的物体的相对视
网膜位置计算得出的。如图 23.2.1 所示,位于固定平面的目标会 2 个视网膜上的相应位置产生图像。位于固
定平面前或后面的物体的图像落在两只眼睛的略有不同位置,这是一种被称为双眼差异的特性。个体神经元可
以选择性地对一小范围的视差(深度位置)进行选择。有些是对位于固定平面上的物体(调谐兴奋性或抑制性
细胞)的选择性,而另一些物体仅当物体位于固定平面(近细胞)或该平面后面(远细胞)时才响应。
深度在目标形状,表面分割以及建立场景的三维特性中起着重要作用。放置在观察者附近的物体可以部分
阻塞那些位于较远的地方。即使在每个视网膜上的二维图像代表封闭器分隔的 2 个表面,也将传递在物体后
的表面被认为是连续的。如 23.2.2 示,当大脑遇到被带有适当对准和对比的间隙中断的表面,并位于近
处的平面中时,它填补了间隙以形成连续的表面。
尽管可以轻松建立一个目标的深度,但是确定场景中多个目标的深度是一个更复杂的问题,需要将两只眼
中所有对象的视网膜图像链接起来。因此,差异计算是一个全局:视觉图像的一个部分的计算会影响其他部分
的计算。当深度分配在图像的一个部分中是明确的,该信息将应用于图像的其他部分,那里的信息不足以确定
深度,这是一种称为差异捕获的现象。
随机点立体图为全局差异分析范围提供了戏剧性的证明。如图 23.2.2C 所示,显示给每只眼睛的视觉信息似
乎是不连贯的,但是当立体图被双眼查看时,2 个图像中的点随机阵列之间的差异允许嵌入形状可见。此感知为
基础的计算并不简单,但需要确定左眼显示的哪些功能对应于右眼看到的特征,并在整个图像中传播局部差异
信息。
二级视觉皮层中的神经元表现出对整体视差线索表现出敏感性。如图 23.2.2B 所示,远处的深度提示可用于
链接属于目标的轮廓元素,并将它们与目标背景区分开。
除了双眼差异外,视觉系统还使用许多单眼线索来区分深度。通过单眼线索的深度确定,例如大小,透视,
483
23.2 深度感知有助于将物体与背景分离
Chapter 23 / Intermediate-Level Visual Processing and Visual Primitives 551
Figure 23–6 Contour integration reflects the perceptual
rules of proximity and good continuation.(Adapted, with
permission, from Li and Gilbert 2002.)
A.A straight line composed of one or more contour elements
with the same oblique orientation appears in the center of
each of the four images here. In some images, the line pops
out more or less immediately, without searching. Factors that
contribute to contour saliency include the number of contour
elements (compare the first and second frames), the spacing of
the elements (third frame), and the smoothness of the contour
(bottom frame). When the spacing between elements is too
large or the orientation difference between them too great, one
must search the image to find the contour.
B.These perceptual properties are reflected in the horizontal con-
nections between columns of V1 neurons with similar orientation
selectivity. As long as the visual elements are spaced sufficiently
close together, excitation can propagate from cell to cell, thus
facilitating the responses of V1 neurons. Each neuron in the net-
work then augments the responses of neurons on either side,
and the facilitated responses propagate across the network.
轮廓平滑度
轮廓平滑度
A 视野
B 横向连接的 V1 神经元
影响轮廓显著性的特征
线条元素的个数
Kandel-Ch23_0545-0563.indd 551 09/12/20 4:50 PM
23.1.4: 轮廓整合反映了接近和良好连续的感知规则
[193]
A. 一条直线由一个或多个具有相同倾斜方向的轮廓
元素组成,出现在此处四个图像中每一个的中心。在某些图像中,线条或多或少会立即弹出,无需搜索。影响轮
廓显著性的因素包括轮廓元素的数量(比较第一帧和第二帧)、元素间距(第三帧)和轮廓的平滑度(底部帧)
当元素之间的间距太大或它们之间的方向差异太大时,必须搜索图像以找到轮廓。B. 这些感知特性反映在具有
相似方向选择性的初级视觉皮层神经元柱之间的水平连接中。只要视觉元素之间的距离足够近,兴奋就可以从
一个细胞传播到另一个细胞,从而促进初级视觉皮层神经元的反应。然后,网络中的每个神经元都会增强任一
侧神经元的响应,并且促进的响应会在整个网络中传播。
484
23.2 深度感知有助于将物体与背景分离
552 Part IV / Perception
Figure 23–7 Stereopsis and binocular disparity.
A.Depth is computed from the positions at which images
occur in the two eyes. The image of an object lying in the
plane of fixation (green) falls on corresponding points on the
two retinas. Images of objects lying in front of the plane of
fixation (blue) or behind it (yellow) fall on noncorresponding
locations on the two retinas, a phenomenon termed binocular
disparity.
B.Neurons in many visual cortical areas are selective for par-
ticular ranges of disparity. Each plot shows the responses of a
neuron to binocular stimuli with different disparities (abscissa).
Some neurons are tuned to a narrow range of disparities and
thus have particular disparity preferences (tuned excitatory or
tuned inhibitory neurons), whereas others are tuned broadly for
objects in front of the fixation plane (near cells) or beyond the
plane (far cells). (Adapted, with permission, from Poggio 1995.
Copyright © 1995 Oxford University Press.)
注视平面
中央凹
A 视网膜图像的双目视差 B 视差选择性神经元
–1.0 1.0
0–1.0 1.0
–1.0 1.0
0
调谐抑制细胞
0
水平视差(度)
近细胞和远细胞
调谐兴奋细胞
determine depth, a phenomenon known as disparity
capture.
Random-dot stereograms provide a dramatic dem-
onstration of the global scope of disparity analysis. The
visual information presented to each eye appears to be
incoherent, but when the stereogram is viewed binocu-
larly, the disparity between the random array of dots in
the two images allows an embedded shape to become
visible (Figure 23–8C). The calculation underlying this
percept is not simple, but requires determining which
features shown to the left eye correspond to features
seen by the right eye and propagating local disparity
information across the image.
Neurons in area V2 display sensitivity to global
disparity cues. Distant depth cues can be used to link
contour elements that belong to an object, and to sepa-
rate them from the object background (Figure 23-B).
In addition to binocular disparity, the visual sys-
tem also uses many monocular cues to discriminate
depth. Depth determination through monocular
Kandel-Ch23_0545-0563.indd 552 09/12/20 4:50 PM
23.2.1: 立体视觉和双眼视差。A. 深度是根据图像在两只眼睛中出现的位置计算的。位于固定平面(绿色)
的物体的图像落在 2 个视网膜上的相应点上。位于固定平面前面(蓝色)或后面(黄色)的物体的图像落在 2
视网膜上的不对应位置,这种现象称为双眼视差。B. 许多视觉皮层区域的神经元对特定的视差范围具有选择性。
每个图显示神经元对具有不同视差(横坐标)的双眼刺激的反应。一些神经元被调谐到一个狭窄的差异范围,
此具有特定的差异偏好(调谐兴奋性或调谐抑制性神经元)而其他神经元则针对固定平面(近细胞)或平面外
(远细胞
1
)前面的物体进行广泛调整
[194]
485
23.2 深度感知有助于将物体与背景分离
A
A
23.2.2: 双眼视差的全局分析。A. 1. 深度线索有助于表面分割。如果您查看 3 个灰色垂直条穿过灰色水平矩形
的图像之一,您会在矩形内看到一个均匀的灰色区域。2. 而,如果你融合 2 个眼睛发散的矩形,3 个垂直条
落在 2 个视网膜上,视差为近、零和远。这样看,左边的条形图似乎悬停在矩形前面,有一条虚幻的垂直边
穿过矩形,而右边的条形图似乎位于水平矩形边缘的后面。B. 二级视觉皮层的神经元对双眼视差线索形成的虚
幻边缘做出反应。当细胞的感受野以灰色正方形为中心时,细胞不响应与正方形有远差异或相同差异的垂直条。
当垂直条具有接近差异时,细胞会在虚幻的垂直边缘穿过其感受野时做出反应
[195]
C. 随机点立体图被视为彩色
点的随机阵列,直到你的眼睛发散或汇聚,使相邻的深色垂直条纹对齐,产生从背景中出现的鲨鱼的三维图像。
这种效应源于所选点集的系统差异。
486
23.3 局部运动线索定义目标轨迹和形状
遮挡,亮度和运动,并不困难。另一个起源于视觉系统之外的线索是会聚,即不同距离物体的双眼光轴之间的角
度。另一种被称为达立体视觉的双眼线索是一只眼睛可见但被另一只眼睛遮挡的特征。
初级视觉皮层和二级视觉皮层中的神经元也表示前景-背景关系。即使该表面的边界远离感受野的边界,在
较大表面内具有感受野中心的细胞也可能响应。这种响应有助于区分目标的背景。在理解图像时,大脑必须确
定哪个边缘属于哪个目标并将每个目标的边缘与背景区分开。如图 23.2.3 所示,级视觉皮中的某些单元
“边框所有权”的属性,仅在图形而不是背景的情况下触发,即使在 2 种情况下局部边缘信息相同时,也是对
边缘的一侧。
554 Part IV / Perception
Figure 23–9 Border ownership.Cells in area V2 are
sensitive to the boundaries of whole objects. Even though
the local contrast is the same for the two rectangles within a
cell’s receptive field, the cell responds only when the bound-
ary is part of the full rectangle that lies on the preferred side
of the receptive field. (Adapted, with permission, from Zhou,
Friedman, and von der Heydt 2000. Copyright © 2000 Society
for Neuroscience.)
cues, such as size, perspective, occlusion, brightness,
and movement, is not difficult. Another cue that
originates outside the visual system is vergence, the
angle between the optical axes of the two eyes for
objects at varying distances. Yet another binocular
cue, known as DaVinci stereopsis, is the presence of
features visible to one eye but occluded in the other
eye’s view.
Neurons in areas V1 and V2 also signal fore-
ground–background relationships. A cell with its
receptive field in the center of a pattern within a
larger surface may respond even when the bound-
ary of that surface is distant from the receptive field.
This response helps differentiate the object from its
background. In making sense of an image, the brain
must identify which edge belongs to which object and
differentiate the edge of each object from the back-
ground. Some cells in area V2 have the property of
“border ownership,” firing only when a figure but
not the background is to one side of the edge, even
when the local edge information is identical in both
instances (Figure 23–9).
Local Movement Cues Define Object Trajectory
and Shape
The primary visual cortex determines the direction of
movement of objects. Directional selectivity in neurons
likely involves sequential activation of regions on dif-
ferent sides of the receptive field.
If an object moving at an appropriate velocity
first encounters a region of a neuron’s receptive field
with long response latencies and then passes into
regions with progressively shorter latencies, signals
from throughout the receptive field will arrive at the
cell simultaneously and the neuron will fire vigor-
ously. If the object moves in the opposite direction,
signals from the different regions will not summate
and the cell may never reach the threshold for firing
(Figure 23–10).
Early in the visual pathways, analysis of the move-
ment of an object is limited by the size of the recep-
tive fields of the sensory neurons. Even in the initial
cortical areas V1 and V2, the receptive fields of neu-
rons are small and might encompass only a fraction of
an object. Eventually, however, information about the
direction and speed of movement of discrete aspects
of an object must be integrated into a computation of
the movement of a whole object. This problem is more
difficult than one might expect.
If one observes a complex shape moving through
a small aperture, the part of the object’s boundary
细胞感受野
V2 细胞响应
目标位于细胞感受野的右侧(首选)
目标
位于细胞感受野左侧
within the aperture appears to move in a direction
perpendicular to the boundary’s orientation (Figure
23–11A). One cannot detect a line’s true direction of
movement if the line’s ends are not visible. The image
of a line appears the same if it is moving slowly
along an axis perpendicular to its orientation or more
quickly along an oblique axis. This is the quandary
presented by the receptive field of a V1 neuron. The
visual system’s solution is to assume that the move-
ment of a contour is perpendicular to its orientation.
Thus, an object is first presented to the visual system
in countless small pieces with boundaries of differ-
ent orientations, all of which appear to be moving in
Kandel-Ch23_0545-0563.indd 554 09/12/20 4:50 PM
23.2.3: 边界所有权。二级视觉皮层中的细胞对整个目标的边界敏感。尽管细胞感受野内的 2 个矩形的局部对
比度相同,但只有当边界是位于感受野偏好侧完整矩形的一部分时,细胞才会响应
[196]
487
23.3 局部运动线索定义目标轨迹和形状
23.3 局部运动线索定义目标轨迹和形状
主要的视觉皮层确定目标运动的方向。神经元中的方向选择性可能涉及感受野不同侧面区域的顺序激活。
如果以适当的速度移动的物体首先遇到具有长响应潜伏期的神经元感受野的区域,然后进入逐渐较短的潜
伏期的区域,则整个感受野的信号将同时到达细胞,而神经元将剧烈发射。如图 23.3.1 所示,如果目标朝相反的
方向移动,则来自不同区域的信号不会汇总,并且细胞可能无法达到触发阈值。
在视觉通路的早期,对物体运动的分析受感觉神经元的感受野的大小的限制。即使在最初的皮层区域初级
视觉皮层和二级视觉皮层中,神经元的感受野也很小,并且可能仅包含一个物体的一小部分。然而,最终必须将
有关目标离散方面运动的方向和速度集成到整个目标运动的计算中。这个问题比人们预期的要困难得多。
如图 23.3.2a 所示,如果一个人观察到一个复杂的形状穿过小孔,则光圈内物体边界的一部分似乎沿垂直于
边界方向的方向移动。如果线的末端不可见,则无法检测到线的真实运动方向。如果线的图像沿着垂直于其方
向的轴缓慢移动,或者沿斜轴迅速移动,则该线的图像看起来相同。这是初级视觉皮层神经元的感受野提出的
难题。视觉系统的解决方案是假设轮廓的运动垂直于其方向。因此,如图 23.3.2a 所示,首先将一个物体呈现给
视觉系统,其中无数小块具有不同方向的边界,所有这些似乎都以不同的方向和不同速度移动。
确定目标的运动方向需要解决多个提示。可以通过将一个栅格放在另一个上方并将 2 个方向移动到不同
向上,可以很容易地证明这一点。如图 23.3.2b 所示,所得的棋盘格态似乎在单个光栅的轨迹之间沿中间方向移
动。这种感知取决于光栅的相对对比和光栅重叠的区域。由于相对对比很大,光栅似乎相互滑动,朝着他们的各
个方向移动,而不是朝着共同的方向移动。
感知方向的一个重要决定因素是场景分割,将移动元素分离到前景和背景。在具有移动目标的场景中,分
割不是基于方向的本地提示。相反,对方向的感知取决于场景细分。理发界的幻觉提供了另一个例子,说明了全
球关系在简单属性的感知上占主导地位。如图 23.3.2c 所示,旋转条纹被认为是沿极长轴垂直移动的。视野中运
动的感知使用了一种复杂的算法,该算法将本地运动信号的自下而上分析与自上而下的场景分割相结合。
在颞中部区域内侧颞五级视觉皮)中观察到了猴子中局部运动信号的整合,该区域是专门运动的
区域。该区域的神经元可以选择整体模式的特定运动方向,而不是模式的单个组成部分。这种对整体模式的依
赖性也可以在理发量效应中与感知方向的响应的对应关系中看到。
23.4 上下文决定视觉刺激的感知
23.4.1 亮度和颜色感知取决于上下文
视觉系统通过比较来自视野的不同部分到达的光来测量目标的表面特征。结果,对亮度和颜色的感知高度
取决于上下文。实际上,感知到的亮度和颜色可能与物体的物理特性所期望的完全不同。同时,如图 23.4.1A
示,即使照亮它们的光的亮度和波长分布从天然光到人造光,从阳光到阴影,或从黎明到中午,即使光的亮度和
波长分布也会显得相似。
当我们四处移动或随着环境照明的变化,物体的视网膜图像(大小,形状和亮度)也会发生变化。然而,
大多数情况下,我们不认为目标本身正在改变。当我们从光明的花园转移到一个昏暗的房间时,到达视网膜的
光强度可能会有所不同。无论是在房间的昏暗照明中,还是在阳光的眩光中,我们仍然看到一件白色衬衫,如白
色和红色的领带。同样,当朋友向你走去时,她被认为越来越近。即使视网膜上的图像确实扩大,您也不会认为
她会变得更大。我们将物体的大小和颜色感知为常量的能力再次说明了视觉系统的一个基本原理:它不像相机
那样被动地记录图像,而是使用视网膜的瞬时和可变刺激来构建稳定的三维世界。
上下文影响的另一个例子是颜色诱导,其中颜色在一个区域中的出现在相邻区域中转向颜色。形状在表面
亮度的感知中也起着重要作用。由于视觉系统假设照明来自上方,因此折叠表面上的灰色斑块躺在表面的顶部
或底部,即使它们实际上是相同的灰色阴影(图 23.4.1b,它们也会大不相同(图 23.4.1b
视觉皮层中某些神经元的反应与感知的亮度相关。大多数视觉神经元对表面边界有反应。视网膜神经节细
胞和统一神经元的感受野的中心仪结构适合捕获边界。大多数此类细胞对表面的内部部分没有反应,因为均匀
488
23.4 上下文决定视觉刺激的感知
a
b
c
d
e
a
b
c
靶细胞
d
e
偏好方向 偏好方向
兴奋性突触
后电位求和
门限
靶细胞中的
脉冲反应
e
d
c
b
a
ab
de
bc
刺激开始
23.3.1: 动的方向选择性。神经元对运动方向的选择性取决于突触前神经元相对于刺激开始的反应潜伏期。
突触前神经元 a b 的反应潜伏期比神经元 d e 的反应潜伏期稍长。当刺激从左向右移动时,神经元 a b
先被激活,但由于它们的反应潜伏期较长,它们的输入到达目标神经元与神经 d e 的输入叠加,叠加的
入导致神经元激活。相反,向左移动的刺激产生的信号会在不同时间到达目标神经元,因此不会达到细胞的放
电阈值
[197]
489
23.4 上下文决定视觉刺激的感知
556 Part IV / Perception
Figure 23–11 The aperture problem and barber-pole illusion.
A.Although an object moves in one direction, each component
edge when viewed through a small aperture appears to move
in a direction perpendicular to its orientation. The visual system
must integrate such local motion signals into a unified percept
of a moving object.
B.Gratings are used to test whether a neuron is sensitive to
local or global motion signals. When the gratings are superim-
posed and moved independently in different directions, one
does not see the two gratings sliding past each other but rather
a plaid pattern moving in a single, intermediate direction. Neu-
rons in the middle temporal area of monkeys are responsive to
such global motion rather than to local motion.
C.Motion perception is influenced by scene segmentation
cues, as seen in the barber-pole illusion. Even though the pole
rotates around its axis, one perceives the stripes as moving
vertically, due to the global vertical rectangle surround of the
barber pole enclosure.
B
C
+=
全局方向/目标方向
局部方向/组件方向
幻觉方向
A
that illuminates them changes from natural to artificial
light, from sunlight to shadow, or from dawn to mid-
day (Figure 23–12A).
As we move about or as the ambient illumination
changes, the retinal image of an object—its size, shape,
and brightness—also changes. Yet under most condi-
tions, we do not perceive the object itself to be chang-
ing. As we move from a brightly lit garden into a dimly
lit room, the intensity of light reaching the retina may
vary a thousandfold. Both in the room’s dim illumina-
tion and in the sun’s glare, we nevertheless see a white
shirt as white and a red tie as red. Likewise, as a friend
walks toward you, she is seen as coming closer; you do
not perceive her to be growing larger even though the
image on your retina does expand. Our ability to per-
ceive an object’s size and color as constant illustrates
again a fundamental principle of the visual system: It
does not record images passively, like a camera, but
instead uses transient and variable stimulation of the
retina to construct representations of a stable, three-
dimensional world.
Another example of contextual influence is color
induction, whereby the appearance of a color in one
region shifts toward that in an adjoining region. Shape
also plays an important role in the perception of sur-
face brightness. Because the visual system assumes
that illumination comes from above, gray patches on a
folded surface appear very different when they lie on
Kandel-Ch23_0545-0563.indd 556 09/12/20 4:50 PM
23.3.2: 径问题和理发杆错觉。A. 然一个物体在一个方向上移动,但当通过一个小孔径观察时,每个组
件边缘似乎都在垂直于它的方向的方向上移动。视觉系统必须将这些局部运动信号整合到对移动物体的统一感
知中。B. 光栅用于测试神经元是否对局部运动信号或全局运动信号敏感。当光栅叠加并沿不同方向独立移动时,
人们不会看到 2 个光栅相互滑过,而是看到格子图案在一个中间方向上移动。猴子中颞区的神经元对这种整
运动而不是局部运动有响应。C. 运动感知受场景分割线索的影响,如理发师杆错觉所示。尽管杆子围绕其轴旋
转,但由于理发杆外壳的全局垂直矩形环绕,人们会认为条纹是垂直移动的。
490
23.4 上下文决定视觉刺激的感知
Chapter 23 / Intermediate-Level Visual Processing and Visual Primitives 557
Figure 23–12 Color and brightness perception depend on
contextual cues.
A.Perceived surface colors remain relatively stable under dif-
ferent illumination conditions and the consequent changes in
wavelength of the light reflected from the surface. The yellow
squares on the left and right cubes appear similar despite the
fact that the wavelengths of light coming from the two sets
of surfaces are very different. In fact, if the blue squares on
the top of the left cube and the yellow squares on the top of
the right cube are isolated from their contextual squares, their
colors appear identical. (Reproduced, with permission, from
www.lottolab.org.)
B.Brightness perception is also influenced by three-dimensional
shape. The four gray squares indicated by arrows all have the
same luminance. The apparent brightnesses are similar in
the left illustration but different in the right illustration. This is
because the visual system has an inherent expectation that
illumination comes from above (the position of the sun relative
to us), and thus the perception that the surface below the fold
in the illustration on the right is brighter than the surface of the
same luminance that lies above. (Reproduced, with permission,
from Adelson 1993. Copyright © 1993 AAAS.)
孤立的
蓝色块
立的
黄色块
A
B
Kandel-Ch23_0545-0563.indd 557 09/12/20 4:50 PM
23.4.1: 颜色和亮度感知取决于上下文提示。A. 感知的表面颜色在不同的光照条件下保持相对稳定,因此从表
面反射的光的波长也会发生变化。尽管来自两组表面的光波长非常不同,但左右立方体上的黄色方块看起来很
相似。事实上,如果将左侧立方体顶部的蓝色方块和右侧立方体顶部的黄色方块与其上下文方块隔离开来,它
们的颜色看起来是相同的。B. 亮度感知也受三维形状的影响。箭头指示的四个灰色方块都具有相同的亮度。左
图中的视亮度相似,而右图中的视亮度不同。这是因为视觉系统有一个固有的期望,即照明来自上方(太阳相
对于我们的位置),因此认为右侧插图中褶皱下方的表面比上方相同亮度的表面更亮
[198]
491
23.5 皮层连接、功能架构和感知密切相关
的内部内部没有产生跨感受野的对比梯度。但是,一小部分神经元确实对表面的内部,信号局部亮度,质地或颜
色做出了反应,这些神经元的反应受到情境影响。即使在感受野内的表面亮度保持固定时,细胞的亮度也会随
着细胞感受野外面的亮度的变化而变化。
由于大多数神经元对表面边界的反应,而不是对均匀亮度的区域的反应,因此视觉系统通过有关表面边缘
的对比度的信息来计算表面的亮度。大脑对边界信息表面质量的分析称为感知填充。如果将黑盘和周围明亮区
域之间的边界固定几秒钟,则磁盘将以与周围区域相同的亮度“填充”之所以发生这种情况,是因为仅当眼睛
或刺激移动时,响应边缘的细胞就会发射。他们逐渐停止响应稳定的图像,不再向边界的存在发出信号。磁盘内
具有感受野的神经元逐渐开始以类似于周围区域具有感受野的方式的方式响应,表现出其感受野特性的短期可
塑性。
尽管在不同的照明条件下,从物体反射的光的波长分布差异很大,但目标的颜色总是显得或多或少。要识
别一个目标,我们必须知道其表面的属性,而不是不断变化的反射光的属性。因此,目标颜色的计算比分析反射
光谱更为复杂。为了确定表面的颜色,必须确定入射光的波长分布。在没有这些信息的情况下,可以通过确定场
景中不同表面的波长平衡来估算表面颜色。如果感知的颜色保持恒定,则四级视觉皮层中的一些神经元对不同
的照明波长的反应类似。通过对广泛表面的光响应,这些神经元可以选择表面颜色而不是波长。
23.4.2 感受野属性取决于上下文
局部和全球效应之间的区 - 感受野中发生的刺激与超越刺激之间的区别在于如何定义感受野本身的
题。由于视觉皮层神经元的感受野的原始表征没有考虑到上下文的影响,因此一些研究人员现在区分“经典”
“非经典”感受野。
但是,即使对感觉感受野的最早描述也可以从狭义定义的感受野外面的感觉表面的某些部分影响。1953 年,
史蒂文 · 库夫勒在对视网膜神经节细胞的感受野特性的开创性观察中,指出:“不仅可以通过视网膜照明来真正
设置响应的区域同样,通过对神经节细胞的抑制或兴奋性影响,所有显示功能连接的区域。这很可能涉及与神
经节细胞有些遥远的区域,并且本身不会设置排放。
更有用的区别与神经元对简单刺激(例如短线段)的响应与对具有多个组分的刺激的响应进行了对比。即
使在主要的视觉皮层中,神经元也是高度非线性的。他们对复杂刺激的反应无法从对视野周围不同位置的简单
刺激的反应中预测。相反,它们对本地特征的响应取决于嵌入功能的全局上下文。上下文影响在中层视觉处理
中普遍存在,包括轮廓集成,场景分割以及目标形状,目标运动和表面属性的确定。
23.5 皮层连接、功能架构和感知密切相关
中层视觉处理需要在整个视野中共享信息。初级视觉皮层中的互连与该区域功能结构的关系表明该回路介
导了轮廓积分。
皮层回路包括由平行于皮层表面的锥体神经元的轴突形成的远程水平连接的丛。水平连接都存在于大脑皮
层的每个区域中,但是它们的功能在一个区域到下一个区域而异,具体取决于每个区域的功能架构。如图 21.6.2
示,在视觉皮层中,这些连接介导了相似特异性的方向列之间的相互作用,从而在大面积的视觉皮层上集成了
信息,这代表了视野的大广阔。
这些水平连接链接神经元相似但代表视野中遥远位置的神经元的事实表明,这些连接在轮廓积分中起作用。
轮廓整合和轮廓显著性的相关特性反映了良好延续的格式塔原理。如 23.1.4 示,两者都是由初级视觉皮
中的水平连接介导的。
对视觉空间整合至关重要的皮层连接的最终特征是高阶皮层区域的反馈预测。反馈连接与起源于丘脑或皮
层加工的早期阶段的前馈连接一样广泛。这些反馈预测的功能知之甚少。他们可能在调解自上而下的关注,期
望和感知任务的影响方面发挥作用,所有这些都会影响皮层处理的早期阶段。
492
23.5 皮层连接、功能架构和感知密切相关
23.5.1 感知学习需要皮层连接的可塑性
眼部占主导地位的突触连接仅适用于在开发的关键时期(第 49 章)。这表明视觉皮层神经元的功能特性
在成年中固定的。然而,皮层神经元的许多特性在一生中仍然可变。例如,视网膜病变后可能发生视觉皮层的变
化。
当局灶性病变发生在 2 视网膜上的相应位置时,最初被剥夺了视觉输入的皮层图的相应部分(称为病
投射区)然而,在几个月的时间里,该区域内的细胞的感受野从视网膜的病变部分转移到了病变周围的功能区
域。结果,如图 23.5.1 所示,视网膜病变部分的皮层表示,周围区域的皮层表示。
560 Part IV / Perception
Figure 23–13 Adult cortical plasticity.When corresponding
positions in both eyes are lesioned, the cortical area receiving
input from the lesioned areas—the lesion projection zone—is
initially silenced. The receptive fields of neurons in the lesion
projection zone eventually shift from the area of the lesion to
the surrounding, intact retina. This occurs because neurons sur-
rounding the lesion projection zone sprout collaterals that form
synaptic connections with neurons inside the zone. As a result,
the cortical representation of the lesioned part of the retina
shrinks while that of the surrounding retina expands.
损伤
皮层
视网膜
两个月后
损伤
because the visual system processes simultaneously,
in parallel pathways, the features of the target and the
surrounding distractors (Figure 23–15). When the fea-
tures of a target are complex, the target can be identi-
fied only through careful inspection of an entire image
or scene.
The pop-out phenomenon can be influenced by
training. A stimulus that initially cannot be found
without effortful searching will pop out after training.
The neuronal correlate of such a dramatic change is not
certain. Parallel processing of the features of an object
and its background is possible because feature infor-
mation is encoded in retinotopically mapped areas at
multiple locations in the visual cortex. Pop-out prob-
ably occurs early in the visual cortex. The pop-out of
complex shapes such as numerals supports the idea
that early in visual processing neurons can represent,
and be selective for, shapes more complex than line
segments with a particular orientation.
Cognitive Processes Influence Visual Perception
Scene segmentation—the parsing of a scene into differ-
ent objects—involves a combination of bottom-up pro-
cesses that follow the Gestalt rule of good continuation
and top-down processes that create object expectation.
One strong top-down influence is spatial atten-
tion, which can change focus without any movement
of an observer’s eyes. Spatial attention can be object-
oriented in that the focus of attention is distributed
over the area occupied by the attended object, allowing
the visual cortex to analyze the shape and attributes of
objects one at a time.
Attentional mechanisms can solve the superposition
problem. Before we can recognize an object in a scene that
includes many objects, we must determine which features
correspond to which objects. Our sense that we identify
all objects in the visual field simultaneously is illusory.
Instead, we serially process objects in rapid succession by
Kandel-Ch23_0545-0563.indd 560 09/12/20 4:50 PM
23.5.1: 成人皮层的可塑性。当双眼的相应位置受损时,接收来自受损区域输入的皮层区域(病变投射区)最
初是沉默的。病变投射区神经元的感受野最终从病变区域转移到周围完整的视网膜。发生这种情况是因为病变
投射区周围的神经元会萌发侧枝,这些侧枝与区域内的神经元形成突触连接。结果,视网膜受损部分的皮层表
现缩小,而周围视网膜的皮层表现扩大。
皮层图和连接的可塑性不是作为对病变的反应而发展的,而是改善我们感知技能的神经机制。视觉皮层分
析的许多属性,包括立体敏锐度,运动方向和方向,随着实践而变得更加清晰。赫尔曼 · · 亥姆霍兹 1866
表示:“感官的判断可以通过经验和在各种情况下得出的训练来修改,并且可以适应新条件。因此,人们可以在
某种程度上学习以利用这种感觉的细节,否则这些感觉将避开通知,而不会有助于获得目标的任何想法。这种
感知学习是多种隐性学习,不涉及有意识的过程(第 52 章)
感知学习涉及多次重复辨别任务,并且不需要错误反馈来提高性能。例如,改善表现出来,例如,在目标刺
激的属性或在复杂的环境中检测目标的能力差异的阈值减少。包括初级视觉皮层在内的几个视觉皮层区域参与
493
23.6 亮点
感知学习。
感知学习的一个重要方面是其特异性:对一项任务的训练不会转移到其他任务。例如,在三行二等任务中,
受试者必须确定 3 个平行线的中心位置是更接近左侧的线还是右侧的线。重复练习后,准确响应所需的中心
置的偏移量大大减少。
学习此任务是特定于视野中的位置和线路的方向。这种特异性表明,视觉处理的早期阶段是造成的,因为在
早期阶段,感受野是最小的,视觉图最精确的,并且定向调整最清晰。该学习也针对刺激配置。如图 23.5.2a
示,对三线一分为二的训练不会转移到装游标的任务任务中,其中上下文是与目标线共线的线。
在感知学习过程中,初级视觉皮层中神经元的响应特性以跟踪感知改善的方式。一个例子以轮廓显著性。
过实践,受试者可以更容易地检测嵌入在复杂背景中的轮廓。检测随轮廓长度改善,初级视觉皮层中神经元的
反应也是如此。如图 23.5.2b 所示,通过实践,受试者提高了检测到较短轮廓的能力,而初级视觉皮层神经元对
较短的轮廓更敏感。
23.5.2 视觉搜索依赖于视觉属性和形状的皮层表示
诸如颜色,方向和形状之类的特征的可检测性与视觉搜索过程有关。如图 23.5.3 所示,在复杂的图像中,
些目标脱颖而出或“弹出”因为视觉系统以并行通路,目标和周围干扰器的特征同时处理。当目标的功能复杂
时,只能通过仔细检查整个图像或场景来识别目标。
弹出现象可能会受到训练的影响。在训练后,如果没有努力搜索,最初就无法发现的刺激。这种戏剧性变化
的神经元相关性不确定。对物体及其背景的特征进行并行处理是可能的,因为特征信息编码在视觉皮层多个位
置的视网膜局部映射区域中。弹出可能发生在视觉皮层的早期。诸如数字之类的复杂形状的弹出支持的想法是,
在视觉处理的早期神经元可以代表并具有选择性的形状比具有特定方向的线段更为复杂。
23.5.3 认知过程影响视觉感知
场景细分(场景分解为不同的目标)涉及到自下而上的过程的组合,这些过程遵循良好延续的格式塔规则
和创造目标期望的自上而下的过程。
一种强大的自上而下的影响是空间注意力,它可以改变注意力的注意力而不会动人的眼睛动作。空间注意
力可能是面向目标的,因为注意力的焦点分布在受访目标占据的区域上,从而使视觉皮层一次分析一个目标的
形状和属性。
注意机制可以解决叠加问题。在我们可以在包含许多目标的场景中识别一个目标之前,我们必须确定哪些
功能与哪些目标相对应。我们的感觉是,我们同时识别视野中的所有目标是虚幻的。取而代之的是,我们通过将
注意力从一个转移到下一个,以快速继承目标进行序列处理。每个分析的结果都建立了对带有许多不同目标的
复杂环境的感知。对物体识别中注意力重要性的戏剧性证明变化。如图 25.2.1 所示,如果一个主题在同
场景的 2 个略有不同的视图之间迅速移动,他将无法在一个视图中发现场景中没有重要组成部分的情况而没
进行大量审查。
另一个自上而下的影响是感知任务。在视觉处理的早期阶段,同一神经元的特性随着视觉辨的类型而变
化。目标识别涉及一个假设检验的过程,其中将来自视网膜到达的信息与目标的内部表示。该过程反映在研究
表明,当没有视觉输入而想象场景的情况下,加工的早期阶段(例如主要的视觉皮层)会激活。
23.6 亮点
1. 视觉需要将目标与背景分离,这是一个涉及轮廓集成和表面分割的过程。
2. 通过依靠自然形式的统计特性来简化此过程。正如 20 世纪初格斯塔尔特心理学家所认可的那样,我们自
然地将场景组件基于相似性,接近性和轮廓平滑度的分组(称为“良好的延续”
494
23.6 亮点
Figure 23–14 Perceptual learning.Perceptual learning is a
form of implicit learning. With practice, one can learn to dis-
criminate smaller differences in orientation, position, depth, and
direction of movement of objects.
A.The improvement is seen as a reduction in the amount of
change required to reliably detect a tilted line or one posi-
tioned to the left or right of a nearly collinear line (vernier
task). Perceptual learning is highly specific, so that training
on a three-line bisection task leads to substantial improve-
ment in that task (left pair of bars in the bar graph) without
affecting performance on the vernier discrimination task
(central pair of bars). However, training specifically on vernier
discrimination does enhance performance on that task (right
pair of bars).
B.Subjects can detect collinear line segments embedded in a
random background more easily as the number of collinear seg-
ments is increased. The responses of neurons in V1 grow cor-
respondingly stronger with the increase in the number of line
segments. After practice, a line with fewer segments stands
out more easily, and with this improvement, the responses in
V1 also increase. (Reproduced, with permission, from Crist, Li,
and Gilbert 2001; Li, Piech, and Gilbert 2008.)
5
4
3
2
1
0
训练前
训练后
0.5
0.6
0.7
0.8
0.9
1.0
19
7
3 5
共线数
感知显著性
神经元调谐
感知显著性
神经元调谐
三线对分任务
方位辨别
游标视敏度任务
门限 (min. of arc)
轮廓检测概率
Wk 1–2
Wk 3–4
训练
任务 任务任务
性能
5个共线
9个共线
A 知觉学习的任务特异性
B 训练过程中神经元响应性的变化
轮廓检测任务
Kandel-Ch23_0545-0563.indd 561 09/12/20 4:50 PM
23.5.2: 知觉学习。知觉学习是内隐学习的一种形式。通过练习,人们可以学会辨别物体在方位、位置、深度
和运动方向上的微小差异。A. 这种改进被视为可靠地检测倾斜线或位于几乎共线的线(游标任务)左侧或右侧
所需的变化量的减少。知觉学习是高度特定的,因此三线平分任务的训练会导致该任务(条形图中左侧的一对
条形)的显著改进,而不会影响游标辨别任务(中央一对条形)的性能。然而,专门针对游标辨别的训练确实可
以提高该任务的表现(右图)B. 随着共线段数量的增加,受试者可以更容易地检测到嵌入在随机背景中的共线
线段。随着线段数量的增加,初级视觉皮层神经元的反应相应增强。经过练习,分段越少的线越容易脱颖而出,
随着这种改进,初级视觉皮层中的响应也增加了
[199-200]
495
23.6 亮点
562 Part IV / Perception
Figure 23–15 One object in a complex image stands
out under certain conditions.
A.A differently colored object pops out.
B.A differently oriented line also pops out.
C.More complex shapes can pop out when they are
very familiar, such as the numeral 2 embedded in a
field of 5s. Rotating the image by 90° renders the ele-
ments of the figure less recognizable, making it more
difficult to find the one figure that differs from the rest.
(Reproduced, with permission, from Wang, Cavanagh,
and Green 1994. Copyright © 1994 Springer Nature.)
A
颜色
C
熟悉的形状
B
朝向
shifting attention from one to the next. The results of each
analysis build up the perception of a complex environ-
ment populated with many distinct objects. A dramatic
demonstration of the importance of attention in object
recognition is change blindness. If a subject rapidly shifts
between two slightly different views of the same scene,
he will not be able to detect the absence of an important
component of the scene in one view without considerable
scrutiny (see Figure 25–8).
Another top-down influence is perceptual task. At
early stages in visual processing, the properties of the
same neuron vary with the type of visual discrimina-
tion being performed. Object identification involves
a process of hypothesis testing in which information
arriving from the retina is compared with internal rep-
resentations of objects. This process is reflected in stud-
ies that have shown that early stages in processing,
such as the primary visual cortex, are activated when
scenes are imagined without visual input.
Highlights
1. Vision requires segregating objects from their
backgrounds, a process involving contour integra-
tion and surface segmentation.
2. This process is simplified by relying on the sta-
tistical properties of natural forms. As recog-
nized by the Gestalt psychologists early in the
20th century, we naturally link scene components
based on grouping rules of similarity, proximity,
and contour smoothness (referred to as “good
continuation”).
3. Neurons in visual cortical areas have properties
consonant with Gestalt grouping rules. They per-
form a local and global analysis of scene proper-
ties in parallel. The local properties are the visual
primitives, which include orientation selectivity,
direction selectivity, contrast sensitivity, disparity
selectivity, and color selectivity. The correspond-
ing global properties include contour integration,
object movement, border ownership, disparity
capture, and color constancy.
4. Perception of visual features is dependent on
context; similarly, neuronal responses are con-
text dependent. The principle underlying these
interactions is the association field, a pattern of
interactions between bits of information that are
mapped across each cortical area. The association
field mediates contour integration in visual cortex
but is likely to be a general feature of processing
throughout the cerebral cortex. The anatomical
Kandel-Ch23_0545-0563.indd 562 09/12/20 4:50 PM
23.5.3: 复杂图像中的一个目标在特定条件下脱颖而出。A. 弹出一个不同颜色的物体。B. 一条不同方向的线也
会弹出。C. 当他们非常熟悉时,更复杂的形状会跳出来,比如数字 2 嵌入在 5 的区域中。将图像旋转 90 度会使
图形的元素难以辨认,从而更难找到与其他图形不同的图形
[201]
496
23.6 亮点
3. 视觉皮层区域中的神经元具有与格式塔分组规则相辅相成的。他们并行地对场景属性进行局部和全局分
析。局部属性是视觉原始要素,其中包括方向选择性,运动方向选择性,对比灵敏度,视差选择性和颜色选择
性。相应的全局属性包括轮廓集成,目标运动,边界所有权,视差捕获和颜色恒定性。
4. 视觉特征的感知取决于上下文;同样,神经元反应依赖上下文。这些相互作用的基本原则是关联区域,
是每个皮层区域映射的信息位之间的相互作用模式。关联场介导视觉皮层中的轮廓整合,但可能是整个大脑皮
层加工的一般特征。关联场的解剖基板包括由皮锥体细胞突形成的远程水平连接网络,该水平连接的长距
离延伸至平行于皮层表面。
5. 不同的视觉皮层区域有助于各种全局属性,并且其开发需要区域之间的相互作用,包括自上而下的影响。
虽然在皮层区域的层次结构中,通过从初级视觉皮层向颞叶(腹侧通路)和顶叶(背侧通路)皮层延伸的前馈连
接强调了对于不断增加的刺激复杂性的选择性,但反馈连接同样重要。
6. 未来的研究将在皮层加工中阐明内在,前馈和反馈皮层连接及其之间的相互作用的相对贡献。有证据
明,神经元不是具有固定功能,而是自适应处理器,在不同的行为背景下扮演不同的功能角色。神经元可以通过
输入选择,表达与任务相关的输入并抑制任务无关的输入来介导这种功能多样性。当异常运行时,这些功能和
连通性动态可能会解释与孤独症和精神分裂症等疾病相关的感知和行为现象。
497